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Abstract — This paper addresses the problem of channel es- 
timation in multi-cell interference-limited cellular networks. We 
consider systems employing multiple antennas and are interested 
in both the finite and large-scale antenna number regimes (so- 
called "massive MIMO"). Such systems deal with the multi-cell 
interference by way of per-cell beamforming applied at each base 
station. Channel estimation in such networks, which is known to 
be hampered by the pilot contamination effect, constitute a major 
bottleneck for overall performance. We present a novel approach 
which tackles this problem by enabling a low-rate coordination 
between cells during the channel estimation phase itself. The 
coordination makes use of the additional second-order statistical 
information about the user channels, which are shown to offer a 
powerful way of discriminating across interfering users with even 
strongly correlated pilot sequences. Importantly, we demonstrate 
analytically that in the large-number-of-antennas regime, the 
pilot contamination effect is made to vanish completely under 
certain conditions on the channel covariance. Gains over the 
conventional channel estimation framework are confirmed by our 
simulations for even small antenna array sizes. 

Index Terms — massive MIMO, pilot contamination, channel 
estimation, scheduling, covariance information. 



I. Introduction 

Full reuse of the frequency across neighboring cells leads to 
severe interference, which in turn limits the quality of service 
offered to cellular users, especially those located at the cell 
edge. As service providers seek some solutions to restore 
performance in low-SlNR cell locations, several approaches 
aimed at mitigating inter-cell interference have emerged in the 
last few years. Among these, the solutions which consist of 
exploiting the additional degrees of freedom made available 
by the use of multiple antennas seem the most promising, 
particularly so at the base station side where such arrays are 
more easily affordable. 

In an effort to solve this problem while limiting the re- 
quirements for user data sharing over the backhaul network, 
coordinated beamforming approaches have been proposed in 
which 1) multiple-antenna processing is exploited at each base 
station, and 2) the optimization of the beamforming vectors at 

Manuscript received Febraary 1, 2012; revised June 15, 2012. This work 
was supported in part by National Natural Science Foundation of China 
under grant No. 60972015 and 61231007, by the European research project 
SAPHYRE under the FP7 ICT Objective 1.1 - The Network of the Future, 
and by the French national ANR- VERSO funded project LICORNE. 

H. Yin was with Huazhong University of Science and Technology, Wuhan, 
430074 China. He is now with EURECOM, 06410 Biot, France (e-mail: 
yin@eurecom.fr). 

D. Gesbert, M. Filippou are with EURECOM, 06410 Biot, France (e-mail: 
gesbert@eurecom.fr, filippou@eurecom.fr). 

Y Liu is with Huazhong University of Science and Technology, Wuhan, 
430074 China (e-mail: liuyz@mail.hust.edu.cn) 



all cooperating base stations is performed jointly. Coordinated 
beamforming does not require the exchange of user message 
information (e.g., in network MIMO). Yet it still demands 
the exchange of channel state information (CSI) across the 
transmitters on a fast time scale, low-latency basis, making 
almost as challenging to implement in practice as the above 
mentioned network MIMO schemes. 

Fortunately a path towards solving some of the essential 
practical problems related to beamforming-based interference 
avoidance was suggested in fl|. In this work, it was pointed 
out that the need for exchanging Channel State Information at 
Transmitter (CSIT) between base stations could be alleviated 
by simply increasing the number of antennas, M, at each 
transmitter (so-called massive MIMO). This result is rooted in 
the law of large numbers, which predicts that, as the number of 
antennas increases, the vector channel for a desired terminal 
will tend to be more orthogonal to the vector channel of a 
randomly selected interfering user This makes it possible to 
reject interference at the base station side by simply aligning 
the beamforming vector with the desired channel ("Maximum 
Ratio Combining" or spatial matched filter). Hence in theory, 
a simple fully distributed per-cell beamforming scheme can 
offer performance scaling (with M) similar to a more complex 
centralized optimization. 

Unfortunately, the above conclusion only holds without 
pilot-contaminated CSI estimates. In reality, channel informa- 
tion is acquired on the basis of finite-length pilot sequences, 
and crucially, in the presence of inter-cell interference. There- 
fore, the pilot sequences from neighboring cells would con- 
taminate each other. It was pointed out in [ 1 1 that pilot contam- 
ination constitutes a bottleneck for performance. In particular, 
it has been shown that pilot contamination effects 121, Q, 
||4] (i.e., the reuse of non-orthogonal pilot sequences across 
interfering cells) cause the interference rejection performance 
to quickly saturate with the number of antennas, thereby 
undermining the value of MIMO systems in cellular networks. 

In this paper, we address the problem of channel estimation 
in the presence of multi-cell interference generated from 
pilot contamination. We propose an estimation method which 
provides a substantial improvement in performance. It relies 
on two key ideas. The first is the exploitation of dormant 
side-information lying in the second-order statistics of the 
user channels, both for desired and the interfering users. In 
particular, we demonstrate a powerful result indicating that the 
exploitation of covariance information under certain subspace 
conditions on the covariance matrices can lead to a complete 
removal of pilot contamination effects in the large A/ limit. We 
then turn to a practical algorithm design where this concept 



is exploited. The key idea behind the new algorithm is the 
use of a covariance-aware pilot assignment strategy within 
the channel estimation phase itself. While diversity-based 
scheduling methods have been popularized for maximizing 
various throughput-fairness performance criteria [51, [0, Q, 
||8], the potential benefit of user-to-pilot assignment in the 
context of interference-prone channel estimation has received 
very little attention so far 

More specifically, our contributions are the following: We 
first develop a Bayesian channel estimation method making 
explicit use of covariance information in the inter-cell inter- 
ference scenario with pilot contamination. We show that the 
channel estimation performance is a function of the degree 
to which dominant signal subspaces pertaining to the desired 
and interference channel covariance overlap with each other 
Therefore we exploit the fact that the desired user signals and 
interfering user signals are received at the base station with 
(at least approximately) finite-rank covariance matrices. This 
is typically the case in reaUstic scenarios due to the limited 
angle spread followed by incoming paths originating from 
street-level users |9|. Finally, we propose a pilot sequence 
assignment strategy based on assigning carefully selected 
groups of users to identical pilot sequences. The gains are 
shown to depend on system parameters such as the typical 
angle spread measured at the base station and the number of 
base station antennas. Performance close to the interference- 
free channel estimation scenario is obtained for moderate 
numbers of antennas and users. 

The notations adopted in the paper are as follows. We 
use boldface to denote matrices and vectors. Specifically, Im 
denotes the M x M identity matrix. Let (X)"^, (X)*, and 
(X)^ denote the transpose, conjugate, and conjugate transpose 
of a matrix X respectively. E{-} denotes the expectation, j|-||^ 
denotes the Frobenius norm, and diagjai, ...,aN} denotes a 
diagonal matrix or a block diagonal matrix with ai, ..., aN at 
the main diagonal. The Kronecker product of two matrices X 
and Y is denoted by X Y. = is used for definitions. 

II. Signal and Channel Models 

We consider a network of L time-synchronizecU cells, with 
full spectrum reuse. Estimation of (block-fading) channels in 
the uplink is consideredJ3 and all the base stations are equipped 
with M antennas. To simplify the notations, we assume the 1st 
cell is the target cell, unless otherwise notified. We assume the 
pilots, of length r, used by single-antenna users in the same 
cell are mutually orthogonal. As a result, intra-cell interference 
is negligible in the channel estimation phase. However, non- 
orthogonal (possibly identical) pilots are reused from cell to 
cell, resulting in pilot contamination from L — 1 interfering 
cells. For ease of exposition, we consider the case where a 
single user per cell transmits its pilot sequence to its serving 

' Note that assuming synchronization between uphnk pilots provides a worst 
case scenario from a pilot contamination point of view, since any lack of 
synchronization will tend to statistically decorrelate the pilots. 

^Similar ideas would be applicable for downlink channel estimation, 
provided the UE is equipped with multiple antennas as well, in which case 
the estimation would help resolve interferences originating from neighboring 
base stations. 



base. The pilot sequence used in the Z-th cell is denoted by: 



S; = [ S;i Si2 ■ ■ ■ Sl-r 



(1) 



The power of pilot sequences are assumed equal such that 
\sn\^ + --- + \sir\^^Tj = l,2,...,L. 

The channel vector between the l-th cell user and the target 
base station is h;. Thus, hi is the desired channel while h;, I > 
1 are interference channels. All channel vectors are assumed 
to be M X 1 complex Gaussian, undergoing correlation due to 
the finite multipath angle spread at the base station side fTOl: 



Tjl/2, , 



1,2,...,L, 



(2) 



where hw; ^ CJ\f{0, Im) is the spatially white M x 1 SIMO 
channel, and CA/^(0,Im) denotes zero-mean complex Gaus- 
sian distribution with covariance matrix Im. In this paper, we 
make the assumption that covariance matrix R; == E{h;hp^} 
can be obtained separately from the desired and interference 
channels (see Section |VI] for how this could be done in 
practice). 

During the pilot phase, the M x t signal received at the 
target base station is 



Y 



^h,sf. 



N, 



(3) 



1=1 



where N e C is the spatially and temporally white 

additive Gaussian noise (AWGN) with zero-mean and element- 



wise variance a„ 



III. COVARIANCE-BASED CHANNEL ESTIMATION 

A. Pilot Contamination 

Conventional channel estimation relies on correlating the 
received signal with the known pilot sequence (referred here as 
Least Squares estimate for example). Hence, using the model 
in (O, a Least Square (LS) estimator for the desired channel 
hi is 

-1 



hi:^=Ysi*(si^Si*) 



(4) 



The conventional estimator suffers from a lack of orthogonality 
between the desired and interfering pilots, an effect known as 
pilot contamination 1*21, fTTl, fT^']. In particular, when the same 
pilot sequence is reused in all L cells, i.e., Si = • • • = s^ = s, 
the estimator can be written as 



hV' = hi 



j:hi+Ns*/T 

;#i 



(5) 



As it appears in (|5]l, the interfering channels leak directly 
into the desired channel estimate. The estimation performance 
is then limited by the signal to interfering ratio at the base 
station, which in turns limits the ability to design effective 
interference-avoiding beamforming solution. 



B. Bayesian Estimation 

We hereby propose an improved channel estimator with the 
aim of reducing the pilot contamination effect, and taking 
advantage of the multiple antenna dimensions. We suggest 
to do so by exploiting side information lying in the second 
order statistics of the channel vectors. The role of covariance 
matrices is to capture structure information related to the 
distribution (mainly mean and spread) of the multipath angles 
of arrival at the base station. Due to the typically elevated 
position of the base station, rays impinge on the antennas with 
a finite angle-of-arrival (AOA) spread and a user location- 
dependent mean angle. Note that covariance-aided channel 
estimation itself is not a novel idea, e.g., in ifTsl . In lfT4l . 
the authors focus on optimal design of pilot sequences and 
they exploit the covariance matrices of desired channels and 
colored interference. The optimal training sequences were 
developed with adaptation to the statistics of disturbance. In 
our paper, however, the pilot design is shown not having an 
impact on interference reduction, since fully aligned pilots 
are transmitted. Instead, we focus on i) studying the limiting 
behavior of covariance-based estimates in the presence of 
interference and large-scale antenna arrays, and ii) how to 
shape covariance information for the full benefit of channel 
estimation quality. 

Two Bayesian channel estimators can be formed. In the first, 
all channels are estimated at the target base station (including 
interfering ones). In the second, only hi is estimated. By 
vectorizing the received signal and noise, our model (|3) can 
be represented as 

y = Sh + n, (6) 

where y — vec(Y), n = vec(N), and h e c^A/xi j^ obtained 
by stacking all L channels into a vector The pilot matrix S 
is defined as 



S ^ [ si ® Im 



Sl «> lAf ] 



(7) 



Applying Bayes' rule, the conditional distribution of the 
channels h given the received training signal y is 



p(h|y) = ^^^^=p(hMy|h). 

p(y) 



(8) 



We use the multivariate Gaussian probability density function 
(PDF) of the random vector h and assume its rows hi , • • • , h^ 
are mutually independent, giving the joint PDF: 



p(h) = 



exp - E hf Rr'h; 



1=1 



7r^A^(detRi---detRi) 



M ■ 



(9) 



Note that we derive this Bayesian estimator under the stan- 
dard condition of covariance matrix invertibility, although we 
show later this hypothesis is actually challenged by reality 
in the large-number-of-antennas regime. Fortunately, our final 
expressions for channel estimators completely skip the covari- 
ance inversion. 

Using (|6]l, we may obtain: 



p(y|h) 



H , 



cxp -(y-Sh) (y-Sh)/a,i 



(ttct.^ 



,Mt 



(10) 



Combining the equations (|9]l and (fTOl i. the expression of ([8j 
can be rewritten as 



P{h\y) = 



exp(-^(h)) 
AB 



(11) 



where A ^ {nalf^ B ^ 7r^*-f(det Ri • • • det Rl)*^ 



7^^^(detR)*^ and 



Hi 



m ^ h" Rh + (y - Sh)^ (y - Sh)/a,1 



(12) 



in which R = diag(Ri, • • • , Rl), R = R"^ 

Using the maximum a posteriori (MAP) decision rule, the 
Bayesian estimator yields the most probable value given the 
observation y Iil5il : 

h = are max pfhly) 



are min Z(h) 

(a^I^M + RS^S)-iRS^y. 



(13) 



Interestingly, the Bayesian estimate as shown in (fT3] l coin- 
cides with the minimum mean square error (MMSE) estimate, 
which has the form 



,MMSE 



KS"{Sns" + allrM)-^y. 



(14) 



( fT3l ) and ( fT4b are equivalent thanks to the matrix inversion 
identity (I + AB)-iA = A(I + BA)-i. 



C. Channel Estimation with Full Pilot Reuse 

Previously we have given expressions whereby interfering 
channels are estimated simultaneously with the desired chan- 
nel. This could be of use in designing zero-forcing type re- 
ceivers. Even though it is clear that Zero-Forcing (ZF) type (or 
other sophisticated) receivers would give better performance 
at finite M (see |3| for an analysis of this problem), in this 
paper, however, we focus on simple matched filters, since 
such filters are made more relevant by the users of massive 
MIMO. Matched filters require the knowledge of the desired 
channel only, so that interference channels can be considered 
as nuisance parameters. For this case, the single user channel 
estimation shown below can be used. For ease of exposition, 
the worst case situation with an unique pilot sequence reused 
in all L cells is considered: 



[ Si S2 



(15) 



Similar to (|7]l, we define a training matrix S = s Ia/- Note 
that S^S — rljv/. Then the vectorized received training signal 
at the target base station can be expressed as 



y:=S^h,+n. 



(16) 



1=1 



Since the Bayesian estimator and the MMSE estimator are 
identical, we omit the derivation and simply give the expres- 
sion of this estimator for the desired channel hi only: 



\l=l 



hi=RiS^(s(^R,|S« + (7,2l,M] y (17) 

-1 

(18) 




Note that the MMSE channel estimation in the presence of 
identical pilots is also undertaken in other works such as 0. 

In the section below, we examine the degradation caused 
by the pilot contamination on the estimation performance. In 
particular, we point out the role played by the use of covariance 
matrices in dramatically reducing the pilot contamination 
effects under certain conditions on the rank structure. 

We are interested in the mean squared error (MSE) of 
the proposed estimators, which can be defined as: Ai = 
E{||h — h|||,}, or for the single user channel estimate Ali = 

E{||hi-hi||2,}. 



The estimation MSE of {T^ is 



M =tr<^R I 



S^S, 



LM 



-R 



(19) 



Specifically, when identical pilots are used in all cells, the 
MSEs are 



7W = tr <^ R Ilm H ^ R 



Xi =tr<^Ri-R2| 



IM 



1=1 



(20) 



(21) 



where Jll is a L x L unit matrix consisting of all Is. The 
derivations to obtain Ai and 7V{i use standard methods and 
the details are omitted here due to lack of space. However, 
similar methods can be found in [16]. Of course, it is clear 
from (|20] i and (ISTT i that the MSE is not dependent on the 
specific design of the pilot sequence, but on the power of it. 
We can readily obtain the channel estimate of ( fTSl l in an 
interference-free scenario, by setting interference terms to 
zero: 



hf '"' = Ri(a2lM + rRi) 's«(Shi + n), 



(22) 



where the superscript no int refers to the "no interference 
case", and the corresponding MSE: 



(23) 



A^™"" = tr<^Ri Im + ^Ri 



D. Large Scale Analysis 

We seek to analyze the performance for the above estimators 
in the regime of large antenna number M. For tractability, our 
analysis is based on the assumption of an uniform linear array 
(ULA) with supercritical antenna spacing (i.e., less than or 
equal to half wavelength). 



Hence we have the following multipath modeO 



VP 



(24) 



p=i 



where P is the arbitrary number of i.i.d. paths, a^p ~ 
CJ\f{0, 5f) is independent over channel index i and path index 
p, where 6i is the i-th channel's average attenuation. a(6') is 
the steering vector, as shown in [l?] 



aW" 



1 

-j2TrS- cos(e) 



^-j27ri ^-!_cos(e) 



(25) 



where D is the antenna spacing at the base station and A 
is the signal wavelength, such that D < A/2. 9ip E [0, tt] 
is a random AOA. Note that we can limit angles to [0, tt] 
because any 6 e [—''',0] can be replaced hy ~9 giving the 
same steering vector 

Below, we momentarily assume that the selected users 
exhibit multipath AOAs that do not overlap with the AOAs for 
the desired user, i.e., the AOA spread and user locations are 
such that multipath for the desired user are confined to a region 
of space where interfering paths are very unlikely to exist. 
Although the asymptotic analysis below makes use of this 
condition, it will be shown in Section HVl how such a structure 
can be shaped implicitly by the coordinated pilot assignment. 
Finally, simulations reveal in Section |V] the robustness with 
respect to an overlap between AOA regions of desired and 
interference channels (for instance in the case of Gaussian 
AOA distribution). 

Our main result is as follows: 

Theorem 1. Assume the multipath angle of arrival 9 yielding 



channel hj, j = 1, . . . , L, in l\24i , is distributed according to 
an arbitrary density pj (9) with bounded support, i.e., Pj{9) = 
QforOi [6*5"'", ef'^''] for some fixed 6*""" s: 9""^'' € [0, tt] . // 



the L ~ I intervals [6'™'^, 6'f^'^'^] , i = 2, . . . , L are strictt 
non-overlapping with the desired channel's AOA interva, 



fl 



9mm ^ ^max]^ ^g l^^^g 



M 



lim hi = hf 



(26) 



Ri 



Proof: From the channel model ( |24] |. we get 

E{a(0,p)a(0,p)^} = <5|E{a(0,)a(e.)''}, 



p ^' 

p=l 



where 9i has the PDF Pi{9) for alH = 1, . . . , i. The proof of 
Theorem 1 relies on three intermediate lemmas which exploit 
the eigenstructures of the covariance matrices. The proofs of 
the lemmas are given in the appendix. The essential ingredient 
is to exhibit an asymptotic (at large M) orthonormal vector 

^Note that the Gaussian model (2) can well approximate the multipath 
model I24t as long as there are enough paths. Since the number of elementary 
path is typically very large, we have P S> 1 this assumption is valid in 
practice. 

^This condition is just one example of practical scenario leading to 
non-overlapping signal subspaces between the desired and the interference 
covariances, however, more general multipath scenaiios could be used. 



basis for R^ constructed from steering vectors at regularly 
sampled spatial frequencies. 

Lemma 1. Define a(x) ^ [ 1 e"-'''^ • • • e'^'^'-^'^-^^'' ]'^ 
and A = spaii{a(x),x G [—1, 1]}. Given 61,62 G [~1, 1] cind 
61 < 62, define B ~ span{a(x),x G [61, 62]}, then 

. dim{yt} = M 

• dim{S} ^ (62 — bi)M /2 when M grows large. 

Proof: See Appendix lAl D 

Lemma [T] characterizes the number of dimensions a linear 
space has, which is spanned by a(x), in which x plays the 
role of spatial frequency. 

Lemma 2. With a bounded support of AOAs, the rank of 
channel covariance matrix R^ satisfies: 



rank(Ri) 
M 
where di is defined as 

di = (cos(6' 



^ di, as M ^f cx). 



<or)) J- 



Proof: See Appendix iB] D 

Lemma |2] indicates that for large AI, there exists a null 
space null(Ri) of dimension (1 — di)M. Interestingly, related 
eigenstructure properties of the covariance matrices were in- 
dependently derived in ifTsl for the purpose of reducing the 
overhead of downlink channel estimation and CSI feedback in 
massive MIMO for FDD systems. 

Lemma 3. The null space null(Ri) includes a certain set of 
unit-norm vectors: 



\mm nfna.n 



as M — > cx). 



null(R,) D span <^ -^, V$ i [9: 

Proof: See Appendix |C] D 

This lemma indicates that multipath components with AOA 

outside the AOA region for a given user will tend to fall in 

the null space of its covariance matrix in the large-number- of- 

antennas case. 

We now return to the proof of Theorem [T] R^ can be 
decomposed into 

R, =U,S,Uf, (27) 

where U,; is the signal eigenvector matrix of size M x rrii, in 
which rrii < diM . Sj is an eigenvalue matrix of size rrii x m^. 
Due to Lemma [3] and the fact that densities Pi{9) and pi{9) 
have non-overlapping supports, we have 



UfUi =0,Vi7^ 1, as M 



(28) 



Combining the channel estimate (fTsT l and the channel model 
(O, we obtain 



i=i 



i=l 



According to ( l28i l, matrices Ri and ^ R; span orthogonal 

1=2 
subspaces in the large M limit. Therefore we place ourselves 



in the asymptotic regime for M, when r ^ R; can be eigen- 

1=2 
decomposed according to 



T^R; = WSW^, 



(29) 



1=2 



where W is the eigenvector matrix such that W^W = I 
and span {W} is included in the orthogonal complement of 
spanjUi}. Now denote V the unitary matrix correspond- 
ing to the orthogonal complement of both span{W} and 
spanjUi}, so that the M x M identity matrix can now be 
decomposed into: 

Im = UiUf + WW^ + VV^. (30) 

Thus, for large M, 

hi - UiSiUf (a^UiUf + alYW" + a^ WW^ 

V 1=1 / 

Due to asymptotic orthogonality between Ui, W and V, 

L 

hi ^ UiSi(a2l„^ + TSi)-iuf (r^ h,; + S^n) 



UiSi(a2l„, + rSi)- V(Uf hi + ;^ Uf h, 



S^ni. 



4=2 



However, since h^ C span {a(6'),V6' G [0™", ^f™]}, we have 



from Lemma [3] that 
Therefore 



ufh. 



Ufhi 



0, for i 7^ 1 when M — > 00. 



lim hi=TUiSi(a2l„^+rSi) Ufhi 



which is identical to h"° '"' if we apply the EVD decomposition 
dZTl i for Ri in ( |22] |. This proves Theorem [T] D 

We also believe that, although antenna calibration is needed 
as a technical assumption in the theorem, orthogonality of co- 
variances signal subspaces will occur in non-tightly calibrated 
settings provided the AOA regions do not overlap. 

IV. Coordinated Pilot Assignment 

We have seen from above that the performance of the 
covariance-aided channel estimation is particularly sensitive 
to the degree with which the signal subspaces of covariance 
matrices for the desired and the interference channels overlap 
with each other In the ideal case where the desired and 
the interference covariances span distinct subspaces, we have 
demonstrated that the pilot contamination effect tends to 
vanish in the large-antenna-array case. In this section, we 
make use of this property by designing a suitable coordination 
protocol for assigning pilot sequences to users in the L 
cells. The role of the coordination is to optimize the use 
of covariance matrices in an effort to try and satisfy the 
non-overlapping AOA constraint of Theorem 1. We assume 
that in all L cells, the considered pilot sequence will be 
assigned to one (out of K) user in each of the L cells. Let 



Q = {1, . . . , K}, then ICi E Q denotes the index of the user 
in the l-th cell who is assigned the pilot sequence s. The set 
of selected users is denoted by U in what follows. 

We use the estimation MSE ( 1211 1 as a performance metric 
to be minimized in order to find the best user set. (l2Qt is 
an alternative if we take the estimates of interfering channels 
into consideration. For a given user set U, we define a network 
utility function 



TABLE I 
Basic simulation parameters 



FiU)^J2 



Mj(U) 



(31) 



where \U\ is the cardinal number of the set U. Aij{U) is the 
estimation MSE for the desired channel at the j-th base station, 
with a notation readily extended from A^i in ( l2Tl i. where this 
time cell j is the target cell when computing Aij. Iljj{U) is 
the covariance matrix of the desired channel at the j-th cell. 
The principle of the coordinated pilot assignment consists 
in exploiting covariance information at all cells (a total of 
KL^ covariance matrices) in order to minimize the sum MSE 
metric. Hence, L users are assigned an identical pilot sequence 
when the corresponding L^ covariance matrices exhibit the 
most orthogonal signal subspaces. Note that the MSE-based 
criterion (IJTT i implicitly exploits the property of subspace 
orthogonality, e.g., at high SNRs, the proposed MSE-based 
criterion will be minimized by choices of users with covariance 
matrices showing maximum signal subspace orthogonality, 
thereby implicitly satisfying the conditions behind Theorem 
[U In view of minimizing the search complexity, a classical 
greedy approach is proposed: 

1) Initialize Z^ = 

2) For / = l,...,i do: 

JCi = argminF(Z^ U {k}) 

kGQ 

U ^UU {ICi} 

End 

The coordination can be interpreted as follows: To minimize 
the estimation error, a base station tends to assign a given 
pilot to the user whose spatial feature has most differences 
with the interfering users assigned the same pilot. Clearly, 
the performance will improve with the number of users, as it 
becomes more likely to find users with discriminable second- 
order statistics. 



V. Numerical Results 

In order to preserve fairness between users and avoid having 
high-SNR users being systematically assigned the considered 
pilot, we consider a symmetric multicell network where the 
users are all distributed on the cell edge and have the same 
distance with their base stations. In practice, users with greater 
average SNR levels (but equal across cells) can be assigned 
together on a separate pilot pool. We adopt the model of a 
cluster of synchronized and hexagonally shaped cells. Some 
basic simulation parameters are given in Table H] We keep 
these parameters in the following simulation, unless otherwise 
stated. 



Cell radius 


1 km 


Cell edge SNR 


20 dB 


Number of user per- 


cell 


10 


Distance from a user to its BS 


800 m 


Path loss exponent 


3 


Carrier frequency 


2 GHz 


Antenna spacing 


A/2 


Number of paths 


50 


Pilot length 


10 



The channel vector between the u-th user in the ^th cell 
and the target base station is 



Hu 



^^E^c. 



lup J^lup ; 



(32) 



p=i 



where 9iup and aiup are the AOA and the attenuation of the 
p-th path between the u-th user in the ^th cell and the target 
base station respectively. Note that the variance of a;„p,Vp is 
5f^, which includes the distance-based path loss /?;„ between 
the user and the target base station (which can be anyone of 
the L cells): 

Oi 

Piu = -yr, (33) 

where a is a constant dependent on the prescribed average 
SNR at cell edge, diu is the geographical distance. 7 is the 
path-loss exponent. 

Two types of AOA distributions are considered here, a non- 
bounded one (Gaussian) and a bounded one (uniform): 

1) Gaussian distribution: For the channel coefficients h;„, 
the AOAs of all P paths are i.i.d. Gaussian random variables 
with mean 6iu and standard deviation a. Here we suppose 
all the desired channels and interference channels have the 
same standard deviation of AOA. Note that Gaussian AOA 
distributions cannot fulfill the conditions of non-overlapping 
AOA support domains in Theorem 1, nevertheless the use of 
the proposed method in this context also gives substantial gains 
as (7^ decreases. 

2) Uniform distribution: For the channel h/u, the AOAs 
are uniformly distributed over [^/„ — ^a, diu + ^a], where 9iu 
is the mean AOA. 

Two performance metrics are used to evaluate the proposed 
channel estimation scheme. The first one is a normalized 
channel estimation error 



/ 



err = lOlogj^ 



E 



'jj 



'■n 



F 



E llhjjll^ 



(34) 



where hjj and hjj are the desired channel at the j-th 
base station and its estimate respectively. Note that we only 
consider the estimation error of the desired channel. The 
second performance metric is the per-cell rate of the downlink 
obtained assuming standard MRC beamformer based on the 
channel estimates. The beamforming weight vector of the j-th 



base station is w^'^'-' 
follows: 



hjj. We define the per-cell rate as 



Y: logad + SINR, 



C^ 



L 



where SINRj is the received signal-to-noise-plus-interference 
ratio (SINR) by the scheduled user in the j-th cell. 

Numerical results of the proposed channel estimation 
scheme are now shown. In the figures, "LS" stands for conven- 
tional LS channel estimation. "CB" denotes the Covariance- 
aided Bayesian estimation (without coordinated pilot as- 
signment), and "CPA" is the proposed Coordinated Pilot 
Assignment-based Bayesian estimation. 



- Conventional LS Estimation 

- Covariance-aided Bayesian (CB) Estimation 

- LS - interference Free Scenario 

- CB - interference Free Scenario 
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Fig. 1. Estimation MSE vs. BS antenna number, 2-cell network, fixed 
positions of two users, uniformly distributed AOAs witli 0/\ = 20 degrees, 
non-overlapping multipath. 

We first validate Theorem[T]in Fig.[T]with a 2-cell network, 
where the two users' positions are fixed. AOAs of desired 
channels are uniformly distributed with a mean of 90 degrees, 
and the angle spreads of all channels are 20 degrees, yielding 
no overlap between desired and interfering multipaths. The 
pilot contamination is quickly eliminated with growing number 
of antennas. 

In Fig. |2] and Fig. [3] the estimation MSEs versus the BS 
antenna number are illustrated. When the AOAs have uniform 
distributions with 9a = 10 degrees, as shown in Fig. |2] the 
performance of CPA estimator improves quickly with M from 
2 to 10. In the 2-cell network, CPA has the ability of avoiding 
the overlap between AOAs for the desired and interference 
channels. For comparison. Fig. |3] is obtained with Gaussian 
AOA distribution. We can observe a gap remains between the 
CPA and the interference-free one, due to the non-boundedness 
of the Gaussian PDF. Nevertheless, the gains over the classical 
estimator remain substantial. 

We then examine the impact of standard deviation cr of 
Gaussian AOAs on the estimation. Fig. |4] shows that the 
estimation error is a monotonically increasing function of a. 
In contrast, an angle spread tending toward zero will cause 
the channel direction to collapse into a deterministic quantity, 
yielding large gains for covariance-based channel estimation. 
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-Covariance-aided Bayesian Estimation 

- Coordinated Piiot Assignment-based Estimation 

- LS - interference Free Scenario 

- CB - interference Free Scenario 
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Fig. 2. Estimation MSE vs. antenna number, uniformly distiibuted AOAs 
with 6/^ = 10 degrees, 2-cell network. 
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- CPA Estimation 
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Fig. 3. Estimation MSE vs. antenna number, Gaussian distributed AOAs 
with c7 = 10 degrees, 2-cell network. 



Figs. |5] and |6] depict the downlink per-cell rate achieved 
by the MRC beamforming strategy and suggest large gains 
when the Bayesian estimation is used in conjunction with 
the proposed coordinated pilot assignment strategy and in- 
termediate gains when it is used alone. Obviously the rate 
performance almost saturates with M in the classical LS case 
(due to pilot contamination) while it increases quickly with 
M for the proposed estimators, indicating the full benefits of 
massive MIMO systems are exploited. 

VI. Discussions 

In this paper, we assumed the individual covariance matrices 
can be estimated separately. This could be done in practice by 
exploiting resource blocks where the desired user and inter- 
ference users are known to be assigned at different times. In 
future networks, one may imagine a specific training design for 
learning second-order statistics. Since covariance information 
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- Covariance-aided Bayesian Estimation 

' Coordinated Piiot Assignment-based Estimation 
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Fig. 4. Estimation MSB vs. standard deviation of Gaussian distributed AOAs 
with A/ = 10, 7-cell network. 



Fig. 6. Per-cell rate vs. standard deviation of AOA (Gaussian distribution) 
with M = 10, 7-celI network. 
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- Covariance-aided Bayesian Estimation 
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Fig. 5. Per-cell rate vs. antenna number, 2-cell network, Gaussian distiibuted 
AOAs with (T = 10 degrees. 



varies much slower than fast fading, such training may not 
consume a substantial amount of resources. 

The proposed coordinated estimation method would intro- 
duce information exchange between base stations. Although 
the second-order statistics vary much slower than the instan- 
taneous CSI, base stations still have to update the covariance 
information every now and then so as to maintain performance. 
Clearly, the overhead depends on the degree of user mobility. 

VII. Conclusions 

This paper proposes a covariance-aided channel estimation 
framework in the context of interference-limited multi-cell 
multiple antenna systems. We develop Bayesian estimators and 
demonstrate analytically the efficiency of such an approach for 
large-scale antenna systems, leading to a complete removal 
of pilot contamination effects in the case covariance matrices 
satisfy a certain non-overlapping condition on their dominant 
subspaces. We suggest a coordinated pilot assignment strategy 



that helps shape covariance matrices toward satisfying the 
needed condition and show channel estimation performance 
close to interference-free scenarios. 



A. Proof of Lemma Q} 
Define the series 



Xi = -1 



Appendix 



2(z-l) 



M 



,M, 



and 



Mi = 






Then we have /i., C y^, Vi = 1, . . . , A/ and 

I _ g-i27r(i-fc) 



Mf M,: = 



M(l_e-^^4F^) 



= 0,k^i. 



Thus {/Xj \i — 1, . . . ,M} forms an orthogonal basis of A, and 
therefore 

dimjyl} = M. 



Define 



fJ-i 



iezn 



^ M{b, + l) ^ ^^ ^ ^^^ Mfe + l) 



11 



where \x~\ and [a;J are rounded-above and rounded-below 
operators respectively. Then B is part of an orthogonal basis 
of the space B, which indicates dim{S} ^ \B\. By counting 
vectors in B, we have that 



Now we define 



C = 



fJ-i 



i G Z and i G 



2 

i,L^ 



ij 



u 



-M{b2 



11, M 



Then C is part of an orthogonal basis of A. Furthermore, 

.M[h2 + l) 



|C|.li%±i)+lJ+M-P 
Consider the equivalent form of B 



1+1 



B 



f{x)cy.{x)dx 



bi 



y\fix)\ <oo, a;e [61,62] 



Take any vector |J,^ e C, we have 



.H 



1 



\H 



Mr / f{x)a.(x)dx = / f{x)oL{xi)" OL{x)dx 

Ibi vM Jbi 

]_ rb2 -^ _ ^-j-n-M{x-Xi) 

= , — / fix)— ^-7 r— da;. 

Since /Xj € C, we can observe Xi ^ [61, 62], thus 



lim /xf^ / f{x)a{x)dx = 0. 



Af- 

Therefore C C S^ when M —>■ 00. Hence we have 
dim{6-L} = M- dim{B} ^ \C\ 

^dmijS} s^ [ ] - [ J - 1. (36) 

Combining dSST l and ( |36] ), we can easily obtain 

dimM^rM^i±lli-L^^i(^i±llj-i 



M(62 - 61) 



-o(Af), 



D 



and Lemma [T] is proved. 

B. Proof of Lemma \2][ 
We define 

b(.T) ^ a (^cos-i(xA)^ ,a: e [-y, y]. (37) 

It is clear from Lemma [T] that b(a:) = a(2x). Hence, for any 
interval [a;'"'",x"^^'^] in [-5,5], 

dim {span {b(x),Va; G [x""", a;'"'^^}} 
~ (a;'"^'' - x"'"')M when A/ is large. (38) 

Additionally, for i = 1, . . . , L, we have 

spanjRJ = span! / a{0)a{e)" p,{e)de\ , 
Thus, due to the bounded support of pi{d), we can obtain 



Then, by interpreting the integral as a (continuous) sum, we 
have 

spanjRJ C span I b(a;), Va; G [— cos(6'™'*''), — 
[^ A A 



cos(C'")] 



From ( |38] |, we obtain 



rank(R,) < (cos(6'5"'") - cos(6if ^'^)) ^ 

A 

rge M, and Lemma |2] is proved. 



for larj 



C. Proof of Lemma \3}[ 
Take an angle $ (f. [0" 



D 



'"■",6'™''] and define 



A a($) 



M 



Then we have 

u«R,u = ^a(<l>)^R,a(<l>) 



i:a^(cl>)E{a(0)a^(0)}a($) 
l:E{|a«($)a(0)f} 



A/-1 




27rj(m-l).2.(cos(*)-cos(e)) 



J_ V^ 27rj(m-l).a(cos($)-cos(0)) 
M 2^^ 



m=0 



P^{9)d9. 



According to the well-known result on the sum of geometric 
series, we can easily obtain 

2 

= 0, 



lim 



A/-1 
_j_ V^ 27rj"(m-l)^(cos(*)-cos(e)) 



since $ 7^ 6i,V6' G [6*™", e™"]. Thus 
lii 

M- 

which proves Lemma |3] 



lim u^RjU = 0, 

M—^oo 



D 



spanjRi} = span • 



H 



a{e)sL{e)" p,{e)de 



span<; / b(^cos(6'))b^(:^cos(6i))p,(6')d6i 
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